Sound Ontology for Computational Auditory Scence Analysis

نویسندگان

  • Tomohiro Nakatani
  • Hiroshi G. Okuno
چکیده

This paper proposes that sound ontology should be used both as a common vocabulary for sound representation and as a common terminology for integrating various sound stream segregation systems. Since research on computational auditory scene analysis (CASA) focuses on recognizing and understanding various kinds of sounds, sound stream segregation which extracts each sound stream from a mixture of sounds is essential for CASA. Even if sound stream segregation systems use a harmonic structure of sound as a cue of segregation, it is not easy to integrate such systems because the de nition of a harmonic structure di ers or the precision of extracted harmonic structures di ers. Therefore, sound ontology is needed as a common knowledge representation of sounds. Another problem is to interface sound stream segregation systems with applications such as automatic speech recognition systems. Since the requirement of the quality of segregated sound streams depends on applications, sound stream segregation systems must provide a exible interface. Therefore, sound ontology is needed to ful ll the requirements imposed by them. In addition, the hierarchical structure of sound ontology provides a means of controlling top-down and bottom-up processing of sound stream segregation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Combining Independent Component Analysis and Sound Stream Segregation

This paper reports the issues and results of AI Challenge: \Understanding Three Simultaneous Speeches". First, the issues of the Challenge are revisited. We emphasis the importance of information fusion of various attributes of speeches (sounds) in separating speeches from a mixture of sounds. This emphasis is supported by comparing two methods of speech separation; computational auditory scene...

متن کامل

Using Eye Movement Analysis to Study Auditory Effects on Visual Memory Recall

Recent studies in affective computing are focused on sensing human cognitive context using biosignals. In this study, electrooculography (EOG) was utilized to investigate memory recall accessibility via eye movement patterns. 12 subjects were participated in our experiment wherein pictures from four categories were presented. Each category contained nine pictures of which three were presented t...

متن کامل

Independent Study Computational Auditory Scene

Aim To do a literature survey of Computational Auditory Scene Analysis and look for features or techniques t hat can be used for purposes such as discriminating a particular sound (speech in this case) from all the other sounds.

متن کامل

Neural Basis and Computational Strategies for Auditory Processing

Title of dissertation: NEURAL BASIS AND COMPUTATIONAL STRATEGIES FOR AUDITORY PROCESSING Mounya Elhilali, Doctor of Philosophy, 2004 Dissertation directed by: Professor Shihab A. Shamma Department of Electrical and Computer Engineering Our senses are our window to the world, and hearing is the window through which we perceive the world of sound. While seemingly effortless, the process of hearin...

متن کامل

Auditory Scene Analysis: Computational Models

Listeners have to make sense of a complex acoustic world containing overlapping sound sources that must be organized into individual auditory objects. Computational auditory scene analysis concerns the use of algorithms inspired by human sound perception whose aim is to extract properties of constituent sound sources in a complexmixture. Starting with representations based on models of how soun...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998